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Abstract 

We present an algorithm which computes the multilinear factors 
of bivariate lacunary polynomials. It is based on a new Gap the- 
orem which allows to test whether P(X) = Ey=i 0;X a ''(l + X)b is 
identically zero in polynomial time. The algorithm we obtain is more 
elementary than the one by Kaltofen and Koiran (ISSAC'05) since it 
relies on the valuation of polynomials of the previous form instead 
of the height of the coefficients. As a result, it can be used to find 
some linear factors of bivariate lacunary polynomials over a field of 
large finite characteristic in probabilistic polynomial time. 
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i Introduction 



The lacunary, or supersparse, representation of a polynomial 

P(X 1 ,...,X n ) = £ a i X l' l --- X n' 1 
7=1 

is the list of the tuples (fly, ay, . . . ,a n ,y) for 1 < j <k. This representation 
allows very high degree polynomials to be represented in a concise manner. 
The factorization of lacunary polynomials has been investigated in a series 
of papers. Cucker, Koiran and Smale first proved that integer roots 
of univariate integer lacunary polynomials can be found in polynomial 
time [3]. This result was generalized by Lenstra who proved that low- 
degree factors of univariate lacunary polynomials over algebraic number 
fields can also be found in polynomial time [21]. More recently, Kaltofen 
and Koiran generalized Lenstra's results to bivariate and then multivariate 
lacunary polynomials [10, 11]. A common point to these algorithms is 
that they all rely on a so-called Gap Theorem: If F is a factor of P(X) = 
Ey=i ajX tt i, then there exists k such that F is a factor of both JVLj ajX a i 

and Ey =fco+1 cijX a ). Moreover, the different Gap Theorems in these papers 
are all based on the notion of height of an algebraic number, and some of 
them use quite sophisticated results of number theory. 

In this paper, we are interested in more elementary proofs for some of 
these results. We focus on Kaltofen and Koiran's first paper [10] dealing 
with linear factors of bivariate lacunary polynomials. We show how a Gap 
Theorem that does not depend on the height of an algebraic number can 
be proved. In particular, our Gap Theorem is valid for any field of char- 
acteristic zero. As a result, we get a new, more elementary algorithm for 
finding linear factors of bivariate lacunary polynomials over an algebraic 
number field. In particular, this new algorithm is easier to implement 
since there is no need to explicitly compute some constants from number 
theory, and the use of the Gap Theorem does not require to evaluate the 
heights of the coefficients of the polynomial. Moreover we use the same 
methods to prove a Gap Theorem for polynomials over some fields of pos- 
itive characteristic, yielding an algorithm to find linear factors of bivariate 
lacunary polynomials of the form (uX + vY + w) with uvvo 7^ 0. Finding 
linear factors with u = is N P-hard, and the same is true for linear factors 
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with v = or w = 0. This follows from the fact that finding univariate 
linear factors over finite fields is NP-hard [17, 1, 12]. In algebraic number 
fields we can find all linear factors in polynomial time, even those with 
uvw = 0. For this we rely as Kaltofen and Koiran on Lenstra's univariate 
algorithm [21]. 

Our Gap Theorem is based on the valuation of a univariate polynomial, 
defined as the maximum integer v such that X v divides the polynomial. 
We give an upper bound on the valuation of a nonzero polynomial 

k 

P(X) = Y^ajX tt i(l + X)Pi. 

7=1 

This bound can be viewed as an extension of a result due to Hajos [8, 23]. 
We also note that Kayal and Saha recently used the valuation of square 
roots of polynomials to make some progress on the "Sum of Square Roots" 
problem [16]. 

Lacunary polynomials have also been studied with respect to other 
computational tasks. For instance, Plaisted showed the NP-hardness of 
computing the greatest common divisor (GCD) of two univariate integer 
lacunary polynomials [25], and his results were extended to finite fields [27, 
15, 10]. On the other hand, some important special cases were identified 
for which the GCD of two lacunary polynomials can be computed in 
polynomial time [4]. Other efficient algorithms for lacunary polynomials 
have been recently given, for instance for the detection of perfect powers [6, 
7] or interpolation [13]. 

Acknowledgments. We wish to thank Sebastien Tavenas for his help 
on Proposition 8, and Erich L. Kaltofen for pointing us out a mistake in 
Theorem 19 in a previous version of this paper. 

2 Bound on the valuation 

In this section, we consider a field K of characteristic zero and polynomials 
over K. 

Theorem 1. Let P = ^ =lfl; -X^(l + X)^ with cc x < ■ ■ ■ < a k . If P is not 
identically zero then its valuation is at most maxy(ay + ( k+ l~^))- 
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A lower bound for the valuation of P is clearly oc\ (and it is attained 
when ocj_ > dl\ for instance). If the family (X a i(l + X)^) 1< y<^ is linearly 
independent over K, the upper bound we get is actually ol\ + (2): At most 

the first (2) lowest-degree monomials can be cancelled. If ccj = cc\ for all 
/, Hajos' Lemma [8, 23] gives the better bound <x.\ + (k — 1). (This bound 
can be shown to be tight by expanding X fc_1 = (—1 + (X + 1)) 1 with 
the binomial formula.) This is not true anymore when the afs are not all 
equal. One can show that the valuation can be as large as oc\ + (2k — 3) 
(see Proposition 8). The exact bound remains unknown, and whether this 
bound is still linear as in Hajos' Lemma or quadratic is open. 

Our proof of Theorem 1 is based on the so-called Wronskian of a 
family of polynomials. This is a classical tool for the study of differential 
equations but it has recently been used to bound the valuation of a sum 
of square roots of polynomials [16] and also to bound the number of real 
roots of some sparse-like polynomials [18]. 

Definition 2. Let f\, . . . , fa G K[X]. Their Wronskian is the determinant of 

the Wronskian matrix 
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The main property of the Wronskian is its relation to linear indepen- 
dence. The following result is classical (see [2] for a simple proof of this 
fact). 

Proposition 3. The Wronskian 0//1, . . . ,fais nonzero if and only if the f/s are 
linearly independent over K. 

The next two lemmas are our main ingredients to give a bound on the 
valuation for P, using a bound on the valuation of some Wronskian. 

Lemma 4. Let f\,...,fk G K[X]. Then 

Val(W(A //c))>EVal(/ 7 )-Q. 
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Proof. Each term of the determinant is a product of k terms, one from 
each column and one from each row. The valuation of such a term is at 
least Ey Val(/y) - Efci i since for all Val(/- W ) > Val(/ ; ) - i. The result 
follows. □ 

We can slightly refine the bound in this lemma. The term of valuation 
Ey Val(/y) — (2) in the Wronskian is indeed the determinant of the matrix 

made of the smallest degree monomials of each /• . This determinant 
can vanish. In fact, one can easily see that this is the case if two f/s 
have the same valuation since this yields two proportional columns in the 
matrix. To use this idea more generally, consider that the f/s are ordered 
by increasing valuation. We define a plateau to be a set {fj Q , . . . ,fj + s } such 
that for < t < s, Val(/y 0+f ) < Val(/y ) + * - 1. The f/s are naturally 
partitioned into plateaux. Suppose that there are (m + 1) plateaux, of 
length po, • ■ ■ , p m respectively, and let fj Q , ... , f; their respective first 
elements. Generalizing the previous remark to plateaux, it can be shown 
that 

vaW!,...,/,)) > t [vmUn) + (2)) - (*)■ CO 

This bound is at least as large as in the lemma. If all the f/s have a different 
valuation, then the bound is equal to the bound stated in the lemma since 
there are in this case k plateaux, each of length 1. On the other side, if 
they all have the same valuation a, there is one plateau of length k and the 
bound is Val(W(/i, ...,f{)) >ka. We investigate the implications of this 
refinement after the proof of Theorem 1. 

Lemma 5. Let fj = X a i(l + X)^, 1 < / < k, such that a.j,^j > kfor all ). If 
the ffs are linearly independent, then 

Val(W (/!,... JO) <!>/• 

;'=i 

Proof. By Leibniz rule, for all i, j 

ffOQ = t (!) ("/M/5;);- f X a >- f (l + Xfr^ ( 2 ) 
f=o W 
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where (m) n = m(m — 1) • • • (m — n + 1) is the falling factorial. Since 

ctj — t > ccj — i and /3y — i + t > — i for all f , 

ff\x) = x^-'(i + x/h x £ (!) ( a; -) ( (^ / ) / _ i x < - 1 (i + xy. 

t=o W 

Furthermore, since a,- >k>i, we can write X a H = X*r*X* _i and since 

fij >k> i, (1 + X)ft -i = (1 + X)^- fc (l + X) fc "'. Thus, X^~ fc (l + X)^~* 
is a common factor of the entries of the /-th column of the Wronskian 
matrix, and X fc_ '(l + X) k ~ l is a common factor of the entries of the z'-th 
row. Together, we get 

W(/x,. . = x£/ a / - ®(l + X)^ - ® det(M) 
where the matrix M is defined by 

My = EQw*(Wi-*x i - t (i + x) t . 

The polynomial det(M) is nonzero since the //s are supposed linearly 
independent and its degree is at most (~). Therefore its valuation cannot 
be larger than its degree and is bounded by (2). 

Altogether, the valuation of the Wronskian is bounded by a.j — (2) + 

(2) = ° 

Proof of Theorem 1. Let P = J^-o ; -X*^(l + X)^, and let / y = X a /(1 + X)/ 3 /. 
We assume first that ay, jSy > for all /, and that the //s are linearly 
independent. Note that Val(/,) = a.j for all /. 

Let W denote the Wronskian of the ffs. We can replace f\ by P in 
the first column of the Wronskian matrix using column operations which 
multiply the determinant by a\ (its valuation does not change). The matrix 
we obtain is the Wronskian matrix of P ', fj_, ■ ■ ■ , fk- Now using Lemma 4, 
we get 

Val(W) > Val(P) + J] otj - 

y>2 
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This inequality combined with Lemma 5 shows that 

Val(P) < *! + Q . (3) 

We now aim to remove our two previous assumptions. If the f/s are 
not linearly independent, we can extract from this family a basis fa, . . • ,fi d - 

Then P can be expressed in this basis as P = Yi=\ d-lfa- We can apply 

Equation (3) to fa,. . . , fa and obtain Val(P) < oc^ + Q). Since < k, we 

have ;'i + d - 1 < k and Val(P) < ol ]x + ( k+ \~ h ). The value of j x being 
unknown, we conclude that 

Val(P)< max («;+(*+*-')). ( 4 ) 

The second assumption is that ay, j6y > A:. Given P, consider P = 

X fc (l + X) /C P = E/ayX^QL + X)^. Then P satisfies ay,jSy > fc, whence by 

Equation (4), Val(P) < maxy(a ; - + ( k+ l~')). Since Val(P) = Val(P) + k and 
ccj = oij + k, the result follows. □ 

Remark 6. In Theorem 1, we can replace (1 + X) by (uX + v) for any 
u, v 7^ 0. Indeed, we can write uX + v = v(^X + 1) and then use the 
change of variables Y = |X. This gives us a polynomial of the same form 
as in the theorem, with the same valuation as the original one. 

Remark 7. Theorem 1 does not hold in positive characteristic as shown 
by the equality (1 + X) 2 " + (1 + X) 2 " +1 = X 2 "(l + X) mod 2. Section 5 
investigates the case of positive characteristic in more details. 

We argued after Lemma 4 that it can be refined. In the previous proof, 
it is used with P, fa, . . . , // c . If all the fa's have the same valuation a, 

Equation (1) gives the bound Val(W) > Val(P) + ((k - l)cc + ( k ~ v )) - 

(2), whence Val(P) < oc + (k — 1). In this case, replacing Lemma 4 by 
Equation (1) gives us a new proof of Hajos' Lemma, with the correct 
bound. 

On the other hand, if the f/s have pairwise distinct valuations, Equa- 
tion 1 gives the same bound as Lemma 4. Yet in this case Lemma 5 can 
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be refined to obtain the bound Val(W) < YLj&j ~ (2)- Again, we find the 
optimal bound for the valuation, that is Val(P) = oc\ here. 

The refinement of Lemma 4 alone is not sufficient to improve Theorem 1 
in the general case. To this end, one needs to improve Lemma 5 as well. As 
already mentioned, it is an open problem to determine the best achievable 
bound for Theorem 1. The next proposition shows that it cannot be as low 
as in Hajos' Lemma. 

Proposition 8. For k > 3, there exists a linearly independent family of polyno- 
mials (X K i(l + X)^')i<y<fc, and a family of rational coefficients 
( a j)i<j<k sucn that the polynomial 

k 

P(X) = £cyX*/(l + X)^ 

;'=i 

is nonzero and has valuation ol\ + (2k — 3). 

Proof. A polynomial that achieves this bound is 

k 

P k {X) = -1 + (1 + X) 2k+3 - £ ajX 2 ' +1 (l + X) k+1 -', 

7=0 

where 

- 2fc + 3 /fc + l + A 
a i~ 2j + l{k + l-j)- 

We aim to prove that Pfc(X) = X 2k+3 . Since it has (k + 3) terms and ol\ = 0, 
this proves the proposition. To prove the result for an arbitrary value of 
1x1, it is sufficient to multiply P^ by some power of X. 

It is clear that P^ has degree (2k + 3) and is monic. Let [X m ]Pj- be the 
coefficient of the monomial X m in P k . Then for m > 

™=(*: 3 )-bC-«-\} 

We aim to prove that [X m ]P k = as soon as m < 2k + 3. Using the 
definition of the fly's, this is equivalent to proving 

X 2k + 3 /fc + l + A / k + l-j \ = (2k + 3 
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To prove this equality, we rely on Wilf and Zeilberger 's algorithm [24], 
and its implementation in the Maple package EKHAD of Doron Zeilberger 
(see [24] for more on this package). The program asserts the correctness of 
the equality and provides a recurrence relation satisfied by the summand 
that we can verify by hand. 

Let F(rn,j) be the summand in equation (5) divided by ( 2k ^ 3 )- We thus 
want to prove that Y^=q F(m,j) = 1- The EKHAD package provides 

2j(2j + l){k + j + 2-tn) 
[ ,]) (2fc + 3-m)(2/-m) 

and claims that 



mF(m + — mF(m,j) 

= F{m,j + l)R(m,j + 1) - F(m,j)R(m,j). (6) 

In the rest of the proof, we show why this claim implies Equation (5), and 
then that the claim holds. 

Suppose first that Equation (6) holds and let us prove Equation (5). If 
we sum Equation (6) for j = to k, we obtain 

k 

m(J^F(m + l,/) - F(m,j)) 
j=o 

= F(m,k + l)R(m,k + l) - F(m,0)R(m,0). 

Since R(m,0) = and F(m,k + 1) = 0, YLj F{m r j) is constant with respect 
to m. One can easily check that the sum is 1 when m = 2k + 2. (Actually 
the only nonzero term in this case is for j = k.) Therefore, we deduce that 
for all m <2k + 3/ J^jF(m,j) = 1, that is Equation (5) is true. 
To prove Equation (6), note that 

F(m + 1,;) _ (; + fc + 2-m)(m + l) 
F(m,j) ~ (m-2j)(2k + 3 -m) 

and 

F(m,; + 1) _ (k + 2- j)(m - 2] - l)(m - 2j - 2) 
F(m,j) ~ (2j + 2)(2j + 3)(j + k + 3-m) ' 

x The bound on m is given by the fact that R{m,j) is undefined for m = 2k + 3. 
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Therefore, to prove the equality, it is sufficient to check that 

j+k+2-m m+1 
° = m m-2j 2k + 3-m ~ m + R ^ 

(k + 2-j)(m-2j-l)(m-2j-2) 

(2j + 2) (2j + 3)(j + k + 3-m) K[m ' ] + L) ' 

This is done by a mere computation. □ 

From Theorem 1, we can deduce the following Gap Theorem. 

Theorem 9 (Gap theorem). Let P = YJi=i ajX a i(uX + v)Pi with u, v 7^ and 
#7+1 > &jr < j < k. Assume that there exists £ such that 

^ +1> l<g( aj+ C + 2" ; ))' (7) 
Then P is identically zero if and only if the polynomials Y^=\ djX a j(uX + v)&i 
and jjj=e+i ajX a i(uX + v)*l are both identically zero. 

In particular, the smallest £ satisfying (7) is the smallest £ satisfying 

'£ 

oce+i > dis- 
proof Let Q = Ey=i ajX K i(uX + vfi and R = P - Q. Suppose that Q is not 

identically zero. By Theorem 1, its valuation is at most maxy(ay + ( £+ 2 •'))• 
Since ccj > ocg + i for / > I, the valuation of R is at least ag + i > maxy(ay + 

( e+ 2 "■''))■ Therefore, if Q is not identically zero, its monomial of lowest 
degree cannot be canceled by a monomial of R. In other words, P = Q + R 
is not identically zero. 

For the second part of the theorem, consider the smallest £ satisfying 
Equation (7). It is clear that ol^+\ > d\ + ( 2 ). Moreover for all / < £, ccj + \ < 

max i < ; (« i + (^" i )). We now prove by induction on j that ctj < 0L\ + (•' 2 ) 
for all j < £. This is obviously true for / = 1. Let j < £ and suppose that 
for all i < j, 0L[ < dl\ + i 1 ^ 1 )- Then 

otj+i < max (&i + ft : + 1 l X\ < &i + max ( ( 1 l \ + ^ ( ' 



i<j \ \ 2 J J i<j \ V 2 J \ 2 

To conclude, we remark that (~^) + ( ; ~ ( 2 _1) ) < ® for all i < j. □ 
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It is straightforward to extend this theorem to more gaps. The theorem 
can be recursively applied to Q and R (as defined in the proof). Then, if 
P = Pi + . . . + P s where there is a gap between Pt and Pt+\ for 1 < t < s, 
then P is identically zero if and only if each Pt is zero. 

3 Algorithms 

In this section, we prove that there exists a deterministic polynomial- time 
algorithm to test if a polynomial of the form 

k 

P =J^ajX a i(uX + v)Pi, (8) 
7=1 

is identically zero and give a deterministic polynomial-time algorithm to 
compute the linear factors of a lacunary bivariate polynomial. The size of 
P is defined by 

k 

size(P) = size(w) + size(z?) + ^(size(ay) + log(a ; /3 7 )). (9) 

;=i 

The algorithms use Lenstra's algorithm [21] or a variant of it for treat- 
ing some special cases. This use of Lenstra's algorithm implies some 
restrictions on the field K in which the coefficients of the polynomials 
lie. In this section, K is an algebraic number field, and it is represented 
as K = Q[£]/ ((p) where cp £ Z[£] is a monic irreducible polynomial. El- 
ements of K are given as vectors in the basis (1,£,. . .,^ degl? ' _1 ). That is 
f or e G K, e = (eg,. . . ,ed e g<p-i) with et = nt/dt for each t where nt,dt G Z. 
Then 

size(e) = log(nidi) H h log(n deg(? ,_ 1 d d eg ( p_i). 

The size of a polynomial defined as above is then approximately the 
number of bits needed to write down its binary representation. 

Theorems 10 and 11 were already proven in [10]. We give here new 
proofs based on our Gap Theorem. The structures of the algorithms we 
propose are the same as in [10]. The only differences are the ones induced 
by the use of a different Gap Theorem. This implies some differences in 
terms of the complexity that are discussed at the end of this section. 
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Theorem 10. There exists a deterministic polynomial-time algorithm to decide if 
a polynomial of the form (8) is identically zero. 

Proof. We assume without loss of generality that a,- + i > cij for all j and 
ol\ = 0. If DL\ is nonzero, X* 1 is a factor of P and we consider P/X* 1 . 

Suppose first that u = 0. Then P is given as a sum of monomials, and 
we only have to test each coefficient for zero. Note that the ocfs are not 

distinct. Thus the coefficients are of the form YLi djV^i. Lenstra [21] gives an 
algorithm to find low-degree factors of univariate lacunary polynomials. 
It is easy to deduce from his algorithm an algorithm to test such sums for 
zero. A strategy could be to simply apply Lenstra's algorithm to YLj djX^ 
and then check whether (X — v) is a factor, but one can actually improve 
the complexity by extracting from his algorithm the relevant part. The 
case v = is similar. 

We assume now that u, v 7^ 0. We split P into small parts P = P\ + 
■ ■ ■ + P s , such that according to the Gap Theorem, P is identically zero if 
and only if each part Pt is identically zero. Formally, let l\, . . . , I s be the 
(unique) partition of {1, . . . , k} into intervals defined recursively as follows. 
Let 1 € l\. For 1 < j < k, suppose that {1,. . .,;'} has been partitioned 
into l\, . . . , It, and let U be the smallest element of It- Then (/ + 1) G It 
if K.j + i < ct if + (•'~2 +1 ), and (/ + 1) G I t +i otherwise. The polynomials 
Pt = YLjeh a jX a i(l + X)Pi satisfy the conditions of Theorem 9. Therefore, 
we are left with testing if the P/s are identically zero. Moreover, X Ui t 
divides Pt for each t and it is thus equivalent to be able to test if each 
Pt/X a 't is identically zero. 

To this end, let Q be a polynomial of the form (8) satisfying ol\ = 
and K.j + i < Q for all /. In particular, < ( k ^ 2 1 ). Consider the change of 
variables Y = uX + v. Then 



is identically zero if and only if Q(X) is. We can express Q(Y) as a sum of 
powers of Y: 



k 



Q(X) = E 



aj U - a i(Y -v)*iYh 



Q0O = EE 



i=i £=0 




iU_ v y Y *j+Pr l . 
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There are at most fc( fc 2 1 ) = C(^ 3 ) monomials. Then, testing if Q(Y) is 
identically zero consists in testing each coefficient for zero. Moreover, each 
coefficient has the form YLj Ci) a j u ~ U ' (~ V Y' where the sum ranges over at 

most k indices. Since L, ccj < (2*) for all /, the terms in these sums have 
polynomial bit-lengths. Therefore, the coefficients can be tested for zero in 
polynomial time. 

Altogether, this gives a polynomial-time algorithm to test if P is identi- 
cally zero. □ 

Theorem 11. Let 

k 

P(X,Y) = Y^ajX a )YPj e K[X,Y]. 
7=1 

There exists a deterministic polynomial-time algorithm that finds all the linear 
factors ofP, together with their multiplicities. 

Proof A linear factor of P is either of the form (Y — uX — v) or (X — a). 
To search factors of the form (X — a), we see P as a univariate polynomial 
in Y whose coefficients are univariate polynomials in X. Then, (X — a) is 
a factor of P if and only if it is a factor of all the coefficients of P viewed 
as a polynomial in Y. Lenstra gives an algorithm to compute linear factors 
of univariate lacunary polynomials [21]. Thus, we can find all the factors 
of the form (X — a) and their multiplicities using his algorithm. 

Now (Y — uX — v) is a factor of P if and only if P(X, uX + v) vanishes 
identically. We can assume that u ^ 0. If v = 0, P(X, uX) = E/ ajU^X a i + ^i. 
Therefore, it vanishes if and only if each coefficient vanishes. But a 
coefficient of this polynomial is of the form YLj a j u ^- Testing such a 
coefficient for zero is done in polynomial time using Lenstra's algorithm 
as in the proof of Theorem 10, and there are at most k of them to test. 

Suppose now that u,v 7^ 0. Since P{X,uX + v) is of the form (8), we 
can use our Gap Theorem (Theorem 9) as in the proof of Theorem 10: 
Let P = Yd=\ X a (')P, where each P; is of the form (8) and satisfies oi\ — 
and (Xk < (^^)- Then by Theorem 9, P(X, uX + v) vanishes if and only if 
Pj(X,uX + v) vanishes for every i. Now apply the same transformation 
to each P,, inverting the roles of X and Y. Then each P, can be written 
as the sum E/Li Y^WP^ where each P^ is of the form (8) and satisfies 
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a i — f>\ — an d «fc/j8jfc < (2)' Furthermore, P(X, uX + u) vanishes if 
and only if all the Pa(X, uX + v) vanish. 

Since the P ? /s are low-degree polynomials, and there are at most 
k of them, one can find all their linear factors. This relies on one of 
the numerous deterministic polynomial-time algorithms to factor dense 
multivariate polynomials that appear in the literature, from [9, 20] to 
[5, 19]. By the above discussion, the linear factors of P are exactly the 
linear factors that all the Pa's have in common. Several strategies can be 
used to find these linear factors: Either we search the linear factors of all 
the Pa's and keep only the ones they have in common, or we search the 
linear factors of one particular Pa (for instance the one of smallest degree) 
and test if they are factors of the other Pa's using our PIT algorithm, or 
we compute the gcd of all the Pa's and then search its linear factors. In 
particular, this last solution directly gives the multiplicities of the factors 
of P, since it is the same as their multiplicities in the gcd. □ 

As Kaltofen and Koiran's algorithm [10], our algorithm uses Lenstra's 
algorithm for univariate lacunar y polynomials [21] to find univariate 
factors of the input polynomial. To compare both algorithms, let us thus 
focus on the task on finding truly bivariate linear factors, that is of the 
form (Y — uX — v) with uv 7^ 0. 

A first remark concerns the simplicity of the algorithm. The compu- 
tation of the gap function is much simpler in our case since we do not 
have to compute the height of the coefficients. This means that the task 
of finding the gaps in the input polynomial is reduced to completely 
combinatorial considerations. 

Both our and Kaltofen and Koiran's algorithms use a dense factoriza- 
tion algorithm as a subroutine. This is in both cases the main computa- 
tional task since the rest of the algorithm is devoted to the computation of 
the gaps in the input polynomial. Thus, a relevant measure to estimate 
the complexity of these algorithms is the maximum degree of the polyno- 
mials given as input to the dense factorization algorithm. This maximum 
degree is given by the values of the gaps in the two Gap Theorems. In 
our algorithm, the maximum degree is ( 2 ). In Kaltofen and Koiran's, it 
is 0{k\ogk + kloghp) where hp is the height of the polynomial P and the 
value log(Tzp) is a bound on the size of the coefficients of P. For instance, if 
the coefficients of P are integers, then hp is the maximum of their absolute 
values. Therefore, our algorithm has a better asymptotic complexity as 



soon as the size of the coefficients exceeds the number k of terms. Fur- 
thermore, the hidden constant in the bound for Kaltofen and Koiran's 
algorithm is only known to be bounded by approximately 15 while the 
corresponding constant in our case is 1/2. 

Note that an improvement of Theorem i to a linear bound instead 
of a quadratic one would give us a better complexity than Kaltofen and 
Koiran's algorithm for all polynomials. Finally, it is naturally possible to 
combine both Gap Theorems in order to obtain the best complexity in all 
cases. 

4 Generalizations 

In this section, we aim to prove some generalizations of the results obtained 
in Sections 2 and 3. The field K is still supposed to be an algebraic number 
field as in Section 3, unless otherwise stated. 

Our first generalization shows that the identity test algorithm of Theo- 
rem 10 can be extended to a slightly more general family of polynomials. 
Namely, the linear polynomial (uX + v) can be replaced by any 2-sparse 
polynomial. 

Theorem 12. Let P = YHj=\ djX a i (uX d + v)^K There exists a deterministic 
polynomial-time algorithm to decide if the polynomial P is identically zero. 

In the theorem, (uX d + v) could be replaced by the seemingly more 
general expression (uX d + vX d ') with d > d' > 0. Yet, in this case 
we can factor out X d . A term X a '(uX d + vX d )n can thus be written 
X«j+d Pj( u x d ~ d ' + v)Pi. This has the same form as in the theorem, replac- 
ing a.j by {oij + d'fij) and d by (d — d'). 

The size of the polynomial in the statement of the theorem is defined 
as in Equation (9) of Section 3 with the additional term log d in the sum. 
This means that the complexity of the algorithm is still polylogarithmic in 
the degree. 

Proof. For all j we consider the Euclidean division of otj by d: otj = qjd + ry 
with Tj < d. We rewrite P as 

k 

P=YL ajX r i(X d yi(uX d + vfi. 
7=1 
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Let us group in the sum all the terms with a common r,. That is, let 



p.(Y) = £ ajYij(uY + v)Pj 

\<j<k 

Tj=i 

for < / < d. We remark that regardless of the value of d, the number 
of nonzero P/s is bounded by k. We have P(X) = E?=o X f P / (X i ). Each 
monomial X* of X ! Pj(X d ) satisfies a = z' mod d. Therefore, P is identically 
zero if and only if all the P/s are identically zero. 

Since each P, is of the form (8), and there are at most k of them, we can 
apply the algorithm of Theorem 10 to each of them. □ 

We now state a generalization of Theorem 1. A special case of this gen- 
eralization is used in the following to extend our factorization algorithm 
of Theorem 11. It is not known whether the most general version of the 
theorem can be used to further extend our algorithms to be able to find 
small-degree factors of lacunary polynomials. 

Note that this result holds whatever field K of characteristic zero is 
considered. 

Theorem 13. Let G Z+ xfc and 

k m 

p =L a iUft ii eK t x ]' 

;=1 i=l 

where the degree of f e K[X] is djfor all i. Let £ £ K and denote by U{ the 
multiplicity of £ as a roof 0//;. T/zen f/te multiplicity Up(^) of £ zzs a root" o/P 
satisfies 

MO < max J] ^flty + (d* - ^) ^ + ^ ^ • 

A proof of this theorem is given in Appendix A. Note that it can be 
stated in the more general settings of rational exponents a/y. It can then be 
seen as a generalization of a result of Kayal and Saha [16, Theorem 2.1]. 

The following corollary, used to find multilinear factors of bivariate 
lacunary polynomials, is a direct consequence of the theorem. 

Corollary 14. Let P = £y=i ajX a i{uX + v)?i(wX + f) 7 >, uvwt ^ 0. If P is 
nonzero, its valuation is at most maxi<y<^(ay + 2( /c+ 2~-')). 
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We now describe how to use this corollary to get a new factorization al- 
gorithm. Compared to Theorem 11, we are now able to find the multilinear 
factors instead of the linear ones. 

Theorem 15. Let P = Ey = i ajX a 'Y^L There exists a deterministic polynomial 
time algorithm to compute all the multilinear factors ofP, with multiplicity. 

Proof sketch. The proof goes along the same lines as the proof of Theo- 
rem 11. Suppose that XY — (aX — bY + c) is a factor of P. Then the rational 
function P(X, a £^£ ) vanishes identically. Let us assume for simplicity that 
a, b, c 7^ 0. (The other cases can be handled separately, as in the proof of 
Theorem 11.) Let 

a Y -I- r ^ 

Q(X) = (X + fc) max '^P(X / ^^) = £ajX a i(aX + c)Pi(X + b)V 

;=1 

where jj = maXj(/3/) — jSy. Then Q is a polynomial and it vanishes if and 
only if the rational function P(X, fl jf_^ c ) does. By Corollary 14, if Q is 

nonzero its valuation is at most maxy(o:y + 2 ( fc+ 2 "•')). We can deduce a Gap 
Theorem: For 1 < Icq < k, let 

h 

Q (X) = ^a^{aX + cfi{X + byi 
;=i 

and Q 1 = Q- Q . Suppose that a: 7co+1 > max^^a,- + 2( fc ° + 2 1 " J ')). Then 
Q vanishes identically if and only if Qq and Q\ both vanish identically. 
Hence, XY — (aX — bY + c) is a factor of P if and only if it is a factor of 
both Pq and Pi, defined by analogy with Qq and Ch: Po is the sum of the 
fco first terms of P and Pi the sum of the (k — ko) last terms. 

This proves that P can be written as a sum of P ; /s as in the proof 
of Theorem 11 such that the multilinear factors of P are the common 
multilinear factors of the P,/s, and such that each P,£ is of the same form 
as P and satisfies a^, (5^ < 1(jf~j_ 1 ). It thus remains to find the common 
multilinear factors of some low-degree polynomials. Since there are at 
most k of them, this can be done in polynomial time. □ 
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5 Positive characteristic 



As mentioned earlier, Theorem 1 does not hold in positive characteristic. 
We considered the polynomial (1 + X) 2 " + (1 + X) 2 " +1 = X 2 "(X + 1) in 
characteristic 2. It only has two terms, but its valuation equals 2 M . There- 
fore, its valuation cannot be bounded by a function of the number of 
terms. Note that this can be generalized to any positive characteristic. In 
characteristic p, one can consider the polynomial £f =1 (l + X)P" + '. 

Nevertheless, the exponents used in all the examples depend on the 
characteristic. In particular, the characteristic is always smaller than the 
largest exponent that appears. We shall show that in large characteristic, 
Theorem l still holds. This contrasts with the previous result [10] that uses 
the notion of height of an algebraic number, and which is thus not valid 
in any positive characteristic. 

In fact, Theorem l holds as soon as W(/i, . . . does not vanish. The 
difficulty in positive characteristic is that Proposition 3 does not hold 
anymore. Yet, the Wronskian is still related to linear independence by the 
following result (see [14]): 

Proposition 16. Let K be afield of characteristic p and f\,...,f\ £ K[X]. Then 
f\, ... , f\ are linearly independent over K[X p ] if and only if their Wronskian 
does not vanish. 

This allows us to give an equivalent of Theorem 1 in large positive 
characteristic. 

Theorem 17. Let P = jjj =1 a ; -X"/(l + X)ft £ K[X] with cc x < ■ ■ ■ < cc k . If 
the characteristic p o/K satisfies p > maxy(ay + /3y), then the valuation of P is 
at most maxj(a.j + ( k+ 2 "O), provided P does not vanish identically. 

Proof. Let ft = X a /(1 + X)^ for 1 < j < k. The proof of Theorem 1 has 
two steps: We prove that we can assume that the Wronskian of the f/s 
does not vanish, and then under this assumption we get a bound of the 
valuation of the polynomial. The second part only uses the non-vanishing 
of the Wronskian and can be used here too. We are left with proving 
that the Wronskian of the f/s can be assumed to not vanish when the 
characteristic is large enough. 

Assume that the Wronskian of the f/s is zero: By Proposition 3, there is 
a vanishing linear combination of the f/s with coefficients bj in K[X p ]. Let 
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us write bj = E hi^ v - Then Ei X ' P hjfj = °- Since de S/; = a ; + I 6 /' < V> 
Yj bi,jfj = f° r a ll *• We have thus proved that there is a linear combination 
of the fj's equal to zero with coefficients in K. Therefore, we can assume 
we have a basis of the f/s whose Wronskian is nonzero and use the same 
argument as for the characteristic zero. □ 

Based on this result, the algorithms we develop in characteristic zero 
for PIT and factorization can be used for large enough characteristics. 
Computing with lacunary polynomials in positive characteristic has been 
shown to be hard in many cases [27, 15, 17, 10, 1, 12]. In particular, it 
is shown in a very recent paper that it is NP-hard to find roots in ¥ p for 
polynomials over F p [1]. 

Let Fps be the field with p s elements for p a prime number and s > 0. 
In the algorithms, it is given as F p [£] / (cp) where <p is a monic irreducible 
polynomial of degree s with coefficients in F„. 

Theorem 18. Let P = Y)=\ a ; -X*>(ttX + v)h £ ¥ps[X], where p > maxy(a ; - + 
/Sy) . There exists a polynomial-time deterministic algorithm to test ifP vanishes 
identically. 

The proof of this theorem is very similar to the proof of Theorem 10, 
using Theorem 17 instead of Theorem 1. The main difference occurs 
when u = or v = 0. In these cases, we rely in characteristic zero on 
an external algorithm to test sums of the form Yj a j v ^' f° r zero. This 
external algorithm does not work in positive characteristic, but these tests 
are actually much simpler. These sums can be evaluated using repeated 
squaring in time polynomial in logjSy, that is polynomial in the input 
length. 

Note that the condition p > maxy(ay + jSy) means that p has to be 
greater than the degree of P. This condition is a fairly natural condition 
for many algorithms dealing with polynomials over finite fields, especially 
prime fields, for instance for root finding algorithms [1], 

The basic operations in the algorithm are operations in the ground 
field Fp. Therefore, the result also holds if bit operations are considered. 
The only place where computations in F„s have to be performed in the 
algorithm is the tests for zero of coefficients of the form Yj C/) a j u ~ 0l i i~ v ) 
where the ttj's and £/s are integers and fly € F p s, and the sum has at most 
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k terms. The binomial coefficient is to be computed modulo p using for 
instance Lucas' Theorem [22]. 

We now turn to the problem of finding linear factors of lacunary 
bivariate polynomials. 

Theorem 19. Let P = £ ; -a ; -X a /Y^ e ¥ p s[X,Y], where p > maxfaj + ]8y). 
There exists a probabilistic polynomial-time algorithm to find all the linear factors 
ofP of the form (wX + vY + w) with uvw 7^ 0. 

Furthermore, deciding the existence of factors of the form (X — w), (Y — w) 
or (X — wY) with w 7^ is HP-hard under randomized reductions. 

Proof. The second part of the theorem is the consequence of the NP- 
hardness (under randomized reductions) of finding roots in F„s of lacunary 
univariate polynomials with coefficients in F p s [17, 1, 12]: Let Q be a lacu- 
nary univariate polynomial over Fps, and define P(X,Y) = Q(X). Then P 
has the same form as in the theorem with /3y = for all /, and factors of 
the form (X — w ) of P are in one-to-one correspondence with roots w of 
Q. Thus, detecting such factors is NP-hard under randomized reductions. 
The same applies to factors of the form (Y — w). Finally let us now define 
P as the homogeneization of Q, that is P(X, Y) = Y de s(Q)Q(X/Y). Then, 
P{wY,Y) = Y de s(Q)p(w,l) = Y de s(Q)Q(w). In other words, factors of P 
of the form (X — wY) correspond to roots w of Q. Thus detecting such 
factors is also N P-hard under randomized reduction. 

For the first part, the algorithm we propose is actually the same as in 
characteristic zero (Theorem 11). This means that it relies on known re- 
sults for factorization of dense polynomials. Yet, the only polynomial-time 
algorithms known for factorization in positive characteristic are probabilis- 
tic [26]. Therefore our algorithm is probabilistic and not deterministic as 
in characteristic zero. □ 
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A Proof of Theorem 13 

a " 

Let Pj = YViLi fi '' ror 1 < / < ^- As in the proof of Theorem i, we first 
assume that the Py's are linearly independent, and the ct^'s not less than 

(*-!)■ 

We can use a generalized Leibniz rule to compute the derivatives of 
the P/s. Namely 

jf = e ( ' t ) ncm <h» 

where (, 1 , ) is the multinomial coefficient. Consider now a derivative 

of the form This is a sum of terms, each of which contains a 

factor (The worst case happens when t different copies of / have 

been each derived once.) In Equation (10), each tj is bounded by /. This 

means that P^ = QijTIifi 1 ' ' f° r some polynomial Q/y. Since the degree 

of P- ^ equals YLid-i&ij ~ L Qi,j has degree Yj.&i&i) ~ ' ~~ Hii^i^ij ~ dfl) = 
(Lid: 1)/. 

Consider now the Wronskian W of the P/s. We can factor out in each 
column n ; fi and in each row Ox fi • At row / and column /, we 
therefore factor out l\if? rk+1 ■ EL/?" 1 "' = YlifT^- Thus, 

W 2 detM 

i=i 

where M^y = Q; r y. Thus, detM is a polynomial of degree at most (Jli — 

I)©- ' ' ' 

Therefore, the multiplicity ^w(£) of £ as a root of W is bounded by its 

multiplicity as a root of /v ' '' 2 plus the degree of det M. We get 

M£) <X> (l>/- Q)j + (E^- 1 ) 
= E E a y + ( d * - Q J ~ (2) ' 
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To conclude the proof, it remains to remember Lemma 4 and use the 
same proof technique as in Theorem 1. It was expressed in terms of the 
valuation of the polynomials, but remains valid with the multiplicity of 
a root. In this case, it can be written as ^w(£) > E; ~~ (2) where W 

is the Wronskian of the Py's. Using column operations, we can replace 
the first column of the Wronskian matrix of the Py's by the polynomial 

P and its derivatives. We get ^w(£) > Hp(£) + E/>2Hp,(£) ~~ (2)' where 
Together with (11), we get 

M£) < Fw(£) - YiVPjffl + 

;>2 

^ E ^ E a i; + - \H) Q j ~ (2) ~ E + 

< E ( ^ a *l + ( d i ~ Pi) 

It remains to remove our two assumptions. If the P/s are not linearly 
independent, we can extract a basis (P^, . . . ,P/ d ). We obtain jip(£) < 

Y^i (u-iOiifa + {di — ^0(2) ) ■ Since rf < fc + 1 — we have 

M£) ^ max E ( + ~~ 

The second assumption is that aj,- > A: — 1 for all z and /. Let 

p = p-Ylfr = L«iYlff-- 

1 i 1 

Since dCij = ocu + k — 1 > k — 1, 

m / 

< rnax J] ( ^o/y + (d,- - ft) 

= ( k ~ 1) E ft* + max ; E + ( d » ~ f »') 9 

Since ^p(£) = ^p(£) + (fc — 1) Ez the result follows. 
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Remark 20. The ordering of the Py's in the theorem is arbitrary. Yet the 
value of the bound depends on this ordering. Therefore, it is possible to 
optimize this bound by using the ordering on the Py's that minimizes the 
bound. Let us define 



The theorem states that jip(£) < maxy Sy. Let )\ < 72 such that Y^i V-i^ih > 
Yi Oi{i r Then > Sy 2 . These two terms appear in the maximum when P^ 
is before Py 2 in the ordering. If Py 1 and Py 2 are exchanged, the two terms are 



replaced by + i d i ~ Vi)( k+ \ h )) and Li(Wij 2 + ( d i ~ Fi)( k+ \ h )- 



Neither term is greater than sy x . This means that an exchange of P^ and 
Pj 2 in the ordering cannot increase the bound in the theorem. 

This proves that to minimize the bound the Py's must be ordered with 
respect to the value of ^fty/. This is consistent with the order on the ay's 
chosen in Theorem 1. We also note that the bound in Theorem 1 is exactly 
recovered as a special case. 
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